Self-Stabilizing Autonomic Recoverer
نویسندگان
چکیده
We suggest to model software package flaws (bugs) by assuming eventual Byzantine behavior of the package. In particular, the package has been tested by the manufacturer for limited length scenarios when started in a predefined initial state; the behavior beyond the tested scenario may be Byzantine. Restarts (reboots) are useful for recovering such systems. We suggest a general yet practical framework and paradigm, based on a theoretical foundation, for the monitoring and restarting of systems. An autonomic recoverer that monitors and restarts the system is proposed, where: The autonomic recoverer is designed to handle different tasks given specific task requirements in the form of predicates and actions. DAG subsystem hierarchy structure is used by a consistency monitoring procedure in order to achieve gracious recovery. The existence and correct functionality of the autonomic recovery is guaranteed by the use of a kernel resident (anchor) process, and the design of the process to be self-stabilizing. The autonomic recoverer uses the new scheme for liveness assurance via on-line monitoring that complements known schemes for on-line ensuring safety.
منابع مشابه
T D C C S-s T P *
This paper advocates the use of self-stabilization as a provable property to achieve the goals of the self-* paradigms for systems, including availability, reliability, serviceability, disaster recovery and autonomic computing. Several recent results starting from hardware concerns, through the operating system, and ending in the applications are integrated: the selfstabilizing microprocessor [...
متن کاملSelSta - A Biologically Inspired Approach for Self-Stabilizing Humanoid Robot Walking
In this paper we elaborate a study on self-stabilizing humanoid robot that achieves run-time self-stabilization and energy optimized walking gait pattern parameters on different kinds of flat surfaces. The algorithmic approach named SelSta uses biologically inspired notions that introduce robustness into the self-stabilizing functionality of the humanoid robot. The approach has been practically...
متن کاملPreserving the Fault-Containment of Ring Protocols Executed on Trees
Reliable and fault-tolerant distributed systems have been attracting more and more attention (see Autonomic Computing Project by IBM, http://www-03.ibm.com/autonomic/). A self-stabilizing protocol is a fault-tolerant protocol that guarantees autonomous recovery from any number of and any type of faults that can affect the data stored locally at some process(es). If the impact of the faults can ...
متن کاملOn Self-stabilizing Systems: an Approach to the Speciication and Design of Fault Tolerant Systems
A self-stabilizing system is one which can recover from transient faults in a nite number of steps. We present a theory for determining if a behavior speciication can be satissed with a self-stabilizing system and if not, then what the largest self-stabilizing subset and smallest self-stabilizing superset of the speciication is. The eeects of self-stabilizing behavior on the control of discrete...
متن کاملSafe and Eventually Safe: Comparing Self-stabilizing and Non-stabilizing Algorithms on a Common Ground
Self-stabilizing systems can be started in any arbitrary state and converge to exhibit the desired behavior. However, self-stabilizing systems can be started in predefined initial states, in the same way as non-stabilizing systems. In this case, a self-stabilizing system can mask faults just like any other distributed system. Moreover, whenever faults overwhelm the systems beyond their capabili...
متن کامل